Andersson-5056-1.jpg

100 Circles: A Geometry of The Tree

Privacy Level: Public (Green)

Surnames/tags: connectors 100_circles connection_finder
This page has been accessed 10,426 times.

Contents

Introduction : The Circles of The Queen

This is a family everyone should be somehow familiar with, namely the British Royal Family of the late Queen Elizabeth II Windsor (1926-2022) (hereafter QEII, with due respect).

Most Royal Family close members have a profile in WikiTree, and the application My Connections is showing them, ordered by their distance in "degrees" to QEII. Each "degree" in the "My Connections" page is what we will call here a "circle" of the Queen. The first circle, or degree, gathers 8 people around her : 2 parents, 1 sister, 1 spouse and 4 children. The second circle gathers the parents, spouses, siblings and children of members of the first circle, 41 people altogether. Grandparents, aunts and uncles, nieces and nephews, grandchildren, and in-laws of the initial profile. My Connections is limited to 1000 results and/or ten circles, a limit reached in the middle of the 5th circle for QEII. The WikiTree Browser extension installed allows to break through this limit. The more recent CC7 application (available from the Tree Apps menu) will list the seven first circles in a table.

Following degrees/circles are computed the same way. Those profiles who were already counted in the previous circles should not be counted twice. It needs the smart algorithm of the Connection Finder to achieve that. Computing the shortest path from QEII to any of the 32+ million (as of Feb 2024) connected profiles allows to place each of them in one single circle, in any given state of the data base. For example former US President Barack Obama currently belongs to the 21st circle.

Counting the population of all circles beyond the seven first circles is possible, thanks to a query specially developed by Aleš Trtnik in late 2020 for the 100 Circles project provided we use it sparingly (it's quite heavy on the server).

Do we really get as far as 100 Circles? Almost ... as of Feb 2024, the furthest connected profiles are 87 degrees from QEII, but actually less than 5000 profiles (the so-called "Outer Rim") are further than 60 degrees. Over 99.5% of connected profiles are within 40 degrees, and over 50% within 20 degrees. The following diagram is showing the distribution of circles population as of February 2024.

Several comments on this figure

  • The logarithmic scale allows to see details of the first and last circles.
  • The growth in first circles is quite regular, with a very slight inflexion around C15.
  • The decrease after the peak (most populated circle is C19) is also very regular.
  • After C60, the erratic aspect of the curve corresponds to a very few branches.

An intriguing question is : who are the people in the 20th circle, one of the most populated one? The answer is : just about anyone. Random examples more or less famous : Matilda (Plantagenet) of England (abt.1156-1189) (direct ancestor at 20 generations), Stanley Ann Dunham (1942-1995) (mother of above quoted Barack Obama), Marie-Hélène Abgrall (1854-1905) (my maternal great-grandmother), Charles Joseph Kobloth (1870-1924) (cabinetmaker in Paris), Henrica van Asten (1757-1832) ... The Connection Finder will help you find more of them. Take a distant profile in the Outer Rim table, check his connection to Windsor-1, e.g., Václav Kuneš. The profile at distance 20 from QEII in this path is in her 20th circle. Strike that one off the path, you'll get another one, etc.

In focus : more women !

We are comparing in this section the distribution of circles population for some other reference profiles. The list has changed over time, and in February 2024 women have taken over, famous ones or less so, spanning nine centuries :

Focus ID CC1 CC2 CC3 CC4 CC7 CC10 CC20 Peak Mean e update
Aquitaine-84 17 117 463 1,561 14,411 39,520 1,092,391 25 27.3 89 2024-03-13
Stewart-6849 17 118 524 1,964 27,871 160,491 20,193,592 18 20.0 79 2024-03-25
Bradford-45 29 294 1,975 9,547 330,717 2,329,672 26,123,240 14 16.8 85 2024-03-24
Roy-300 32 290 1,804 8,975 266,896 917,985 21,842,117 17 19.1 87 2024-03-26
De_Beer-124 30 282 1,690 8,649 208,069 539,952 12,947,092 21 21.8 91 2024-03-26
Larsdotter-1579 19 97 410 1,135 5,490 16,746 6,058,791 22 24.4 96 2024-03-26
Lannéval-1 37 201 365 465 1,389 5,068 19,315 30 32.9 100 2024-03-25
Davis-26671 28 138 384 882 11,692 119,829 15,479,948 20 21.5 93 2024-03-27
Gardahaut-1 7 45 137 263 2,51734,436 16,619,440 18 21.2 94 2024-03-27
Coleman-6364 8 30 49 82 646 12,865 13,672,720 19 22.3 93 2024-03-27
Windsor-1 8 49 189 536 9,131 84,911 17,246,067 19 21.0 87 2024-03-27
Luker-573 14 97 428 1,460 45,327 243,616 20,098,638 16 19.6 92 2024-03-27
Haupaix-1 3 16 59 142 2,037 2,691 16,379 33 34.2 100 2024-03-27

Columns CC1 to CC20 give the cumulative population of circles. For example the column CC7 gives the total population of circles up to the 7th, in other words the number of profiles at a maximal distance of 7 degrees from the Focus. This is the same CC7 number as the Connection Counts appearing on member profiles.

Peak value is the distance of the most populated circle. In standard statistics vocabulary, it's called "Mode" of the distribution.

Mean value is the average of all distances to the Focus. Due to the "long tail" of the distribution, the mean value is always slightly greater than the peak value. As a reference, the average distance between two random connected profiles, based on samples constructed using Jamie Nelson's very cool application, can be assessed, as of July 2022, to be in the interval [22-25] with a confidence of 95%. See G2G discussion for more details.

e is the eccentricity of the profile, in other words the distance of the furthest profiles, or the radius of the greatest non-empty circle.

Former featured profiles included :

Other profiles have been previously included at the beginning of the project in 2021 : Marie Mars (1689-1776), Adaline (Carlton) Van Wye (1826-1897), Hermann Alexander von Keyserling (1880-1946) chosen by their respective PM, involved in the original project team.

Shapes of the circles distribution

Are circles distributions really as different as they seem from the above table? The differences are driven by different parameters at different scales. The sum of the populations of all the circles will be the same for all profiles, because it is the total population of the connected Tree. (This is exactly true only if data are retrieved the same day, the Tree is growing by about 10k profiles every day).

  • In the closest circles, the differences are mainly linked to the care taken of the Focus and close relatives by the local PMs, and of course the size of first few circles' families. Large families have potentially over 100 profiles in C2, between 300 and 400 in C3, and over 1,000 in C4. One important part of the work in progress here is to ensure that those first circles are as complete as possible. For our reference Focus Mary Stuart, like for many "Euro Aristos" such a work has already mostly been achieved, hence the great values for C4 and C5. Similar values are expected to be reached for the (far less notable!) Vatant-5 and Andersson-5056, for whom a systematic completion is under way.
  • Differences in C10 and C20 are linked to the distance of the Focus to the (mostly American) bulk of WikiTree. For profiles such as Jean-Joseph Vatant, it takes up to the 15th circle to see a significant growth of the circles population, and the peak circle is currently C28. Shortcuts to the bulk are certainly yet to be discovered. (One important parameter in this case is the very low WikiTree adoption rate in France)
  • In all cases, the population of the peak circle is about the same, a little less than 10% of the total Tree population. When put together on the same diagram, most distributions look globally quite the same, simply more or less shifted to higher distances, and the peak more or less sharp.

Nevertheless, differences appear when looking more closely at a variety of cases. For most profiles under study, the growth to the peak is steeper than the decline to the long tail. In many cases, a secondary "bump" appears beyond the peak, as for Patty (Luker) LaPlante below.

Some cases show at the opposite a remarkable symmetry of the distribution on each side of the peak, such as Emma (Davis) Schipp. The reasons of such differences are not known at this stage.

South African profiles typically show a quick start, up to 100k around C8, then a dip between C10 and C15. This is due to a heavily connected local cluster.

Outer Rim profiles, generally connected through a long and tortuous path and several bottlenecks, display a chaotic distribution over many circles. Of course, such a distribution is likely to collapse drastically whenever a shorcut is found.

How the circles grow

It's raining where it's wet

With WikiTree growth, the circles population is of course changing. The overall population of The Tree is growing at a rate of over 3 million per year. How is this growth distributed over the 100 circles?

The first few circles growth is of course driven by the activity of the local PMs. If they have done a good job, up to the second and third circles can be complete after months of work, and are not subject to significative changes afterwards.

In the "far circles" (C15 and beyond), growth is driven by the global, somehow random, activity of WikiTree, independent of local growth. Since it's always raining where it's wet, most of the growth happens in the already most populated circles. Moreover, new reconnections keep bringing back profiles down to lower circles, making the overall distribution sharper and sharper with time.

As an illustration, the following plot compares the circles population for Jean-Joseph Vatant in November 2020 and November 2022.

In this global picture the work done during the same period to systematically populate the first circles (up to C4) is just invisible. Actually this work had practically no impact on the global distribution, given the strong endogamy of those first circles. The few reconnections with a visible global impact have happened beyond C10.

Seven circles ... and beyond

Léa Haupaix, featured as our last reference profile, has joined WikiTree in January 2021 and her branch was first connected to the Big Tree a few months later. As for many French people, finding connection paths is not obvious, and the first ones are generally going through tortuous bottlenecks. With time, and a hard work on the first circles to grow her CC7, more connections have been found, and Léa's mean distance which was over 38 in 2021 has fallen below 35. Still far above the average, but following the good direction!

The following plot compares Léa's circles population since July 2021. The progress on first circles is well visible thanks to the logarithmic scale, as well as the progressive reduction of the gap around the 10th circle, and the global shift of the curve towards smaller values. Note also the chaotic behaviour of the "long tail" of profiles beyond C70, which would look the same for any reference profile, see The Outer Rim of the Global Tree for more on this.

For other examples, see also the 2020 page Bridges from Sweden, which turned out as a comparison between Olof Andersson and four other profiles, has been updated to show the distribution changes in the circle populations of the same five profiles - showing the contrast between profiles affected only by the general growth of the Big Tree and profiles where "reconnections" have been actively worked on.

A slow but steady "collapse" of circles

For all studied reference profiles the peak of the distance distribution is getting sharper and shifted to lower values. The following table shows the changes in mean distance over 3 years for a few reference profiles.

Jan 21 Jan 22 Jan 23 Jan 24
Samuel Lothrop 17.5 17.3 17.1 16.9
Mary Stuart 20.4 20.2 20.2 20.0
Elizabeth II Windsor 22.0 21.8 21.4 21.1
Olof Andersson 32.0 30.0 25.0 24.5
Jean-Joseph Vatant 33.1 32.4 30.9 30.6

There is of course a limit to the growth of each circle, hence to the "collapse" of the distribution towards lower circles, but this limit is far from reached, for three main reasons :

  • Most connected profiles have not completed even their first circle : parents, siblings, spouse(s) and children are still missing.
  • Each new reconnection pulls profiles to lower circles, and the whole distribution to lower distances. Such collapse events can be spectacular for profiles still at far distances from the bulk of the Tree.
  • Many people in the first circles are yet to be born, or not yet included because they are too recent, for privacy reasons.

This last point is fascinating to consider. For example the 12 great-grandchildren of QEII are not yet in WikiTree, and when they are they will add to her C3. Later on, hopefully, their children will add to C4 by 2040, and their grandchildren to C5 by 2070, the sixth generation will be born somewhere around the turn of next century. With a mean of 3 generations by century, C9 members will be added by 2200, etc. Even Mary Stuart, who might look quite ancient to most of us, has still more than two centuries to wait to see the last members of her 20th circle come to life. QEII belongs to her 12th circle only ...

The shape of things to come : Billion Profiles Single Tree

Based on current trends, one can conjecture what the circles distribution would look like when the Single Tree has passed the billion threshold. Such a figure seems a bold hypothesis, but keeping the current annual growth rate of about 15%, this would be achieved in less than 30 years! In the Billion Profiles Single Tree, it can be conjectured that for most connected profiles, the distribution would converge towards having a similar shape, with a peak in the 15-20 range, and a mean distance below 20. Differences would simply came from the population of the very first circles, Samuel Lothrop case would certainly stay as a reference of sharp growth. One can wonder how such figures are possible, but various estimations of the issue, based on the mean size of families, indicate a ballpark estimation of tenfold growth every 2 circles in the ascending part of this asymptotic distribution, once the 10,000 threshold is passed, somewhere around the 6th circle. A tenfold growth every 2 circles would give this kind of figures for cumulated population :

  • CC6 : 10,000
  • CC8 : 100,000
  • CC10 : 1M (figure already passed by Samuel Lothrop)
  • CC12 : 10M
  • CC14 : 100M ...

... reaching half a billion around C15, which would be the peak circle (as it already is in Lothrop distribution)

All this is of course highly conjectural, but indicates that the shape of things to come is somehow already visible in the current state of affairs. But let the Tree grow, and let's keep gathering data!


Bottom line

One can ask what all this can bring to the WikiTree experience, the genealogist work and everyday life. Here come a few answers (non-exhaustive list).

  • Shifting viewpoint from "my ancestors", "my family", "my blood" to a more peripheral vision, towards an attention to all four directions of kinship (parents, children, siblings, spouses).
  • Systematic exploration and expansion of the first circles is likely to reach and go over your zone of genealogical comfort, discovering unexpected alliances and filiations, traveling further across time, space and social boundaries.
  • As the original Jean-Joseph Marie Vatant and Olof Andersson cases show, even very endogamic first circles eventually expand towards every corner of The Tree. Checking how and when this happens is a good lesson in history of migrations and exogamy.

See also

Free-space pages:

G2G discussions:

External links:





Collaboration
  • Login to request to the join the Trusted List so that you can edit and add images.
  • Private Messages: Contact the Profile Managers privately: Bernard Vatant and Eva Ekeblad. (Best when privacy is an issue.)
  • Public Comments: Login to post. (Best for messages specifically directed to those editing this profile. Limit 20 per day.)
  • Public Q&A: These will appear above and in the Genealogist-to-Genealogist (G2G) Forum. (Best for anything directed to the wider genealogy community.)


Comments: 14

Leave a message for others who see this profile.
There are no comments yet.
Login to post a comment.
I was happy to see Emma (Davis) Schipp retained. Thank you. I can say her CC1 is complete and I have resumed work on her CC2 with many more profiles to be created yet. Will be interesting to see how these additions impact her subsequent circles and distribution. Thanks for your work in this space. Regards, Susan
posted by Susan Stopford
With such an impressive C1, and the quality of your work around her profile, there was really no reason not to keep her in focus. Looking forward indeed for the expansion of the following circles.
posted by Bernard Vatant
Thank you for the kind words. I do put a lot of effort into each profile I work on. Incredibly, I do not think I will be able to complete Emma’s CC2 , at least for a while . Although she was born almost 162 years ago (in 1862), the records for some of her CC2s are still embargoed for privacy reasons under Australian law.
posted by Susan Stopford
I'm not surprised, similar situation here in France. My great-grandmother Catherine Penanguer (1854-1944) was born 8 years before Emma, but her youngest greatchildren (including my father) were born in 1923, and one of them, Joseph Marie Conan (1923-2022) lived to almost 100. And when I created his profile back in 2020 he was "Unlisted" and I had no access to his birth record.
posted by Bernard Vatant
It looks like Jim Loden has surpassed Patty (Luker) LaPlante as the most connected member in the section above in Other reference profiles. I didn't want to make the change in case this was not updated for some reason I am not aware.
posted by Kent Smith
Hi Kent. I've been aware of the situation for quite a while. But with due respect to Jim's awesome achievement, I kept Patty as a reference profile for at least two reasons : follow-up of figures, and gender balance in the table (six women, six men).

Moreover, whenever or if ever the issue of adopted vs biological parents is solved, the two profiles of Patty will be merged, and adding connections of both her biological and adoptive parents should explode her figures well over those of Jim.

For the record, the circles distribution of Jim is quite similar to Patty's one, current figures are a peak at d=17, and a CC20 just above 19M.

posted by Bernard Vatant